EXPONENTIAL SUMS OVER DEFINABLE SUBSETS OF FINITE FIELDS 
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Abstract. We prove some general estimates for exponential sums over subsets of finite fields 
which are definable in the language of rings. This generalizes both the classical exponential 
sum estimates over varieties over finite fields due to Weil, Deligne and others, and the result of 
Chatzidakis, van den Dries and Macintyre concerning the number of points of those definable 
sets. As a first application, there is no formula in the language of rings that defines for infinitely 
" many primes an "interval" in Z/pZ that is neither bounded nor with bounded complement. 

o 
o 

Ch ■ 1. Introduction 

Exponential sums are ubiquitous in analytic number theory, in various shape and forms. A 
basic type is a sum 



X 



(1) S f (M,N)= e (/( n ))' 

M^n<M+N 



. 

where e(z) = e 2j?rz and / is some real-valued function. These tend to arise naturally in any 
asymptotic counting problem, as ways to express the secondary terms after isolating a "main 
term" and the basic goal is to establish some form of cancellation, of the type 

(2) Y, e(f(n))«N6(N)-\ 

(N| 1 M^n<M+N 

\q | where the saving 9(N) from the trivial bound N is a positive increasing function with 9(N) — » 

+oo as iV — » +oo. Evidently, it must be the case that / varies "fast enough" for such an 
estimate to hold. 

Various highly ingenious methods have been developed to deal with the distinct possible 
types of phase functions /; the names of Weyl, van der Corput and Vinogradov in particular 
are attached to the most classical ideas (see e.g. |IK1 §8]). It was however discovered that this 
type of analytic questions could sometimes be attacked using highly involved algebraic tools: if 
the interval of summation is of the type ^ n < p, where p is prime, and if f(n) = g(n) /p, where 
g is a polynomial or a rational function, the best general results come from an interpretation 
as an exponential sum over the finite field Z/pZ. 
Indeed, one introduces the "companion" sums 



s v = £ 



Tr/(x)) 
p 



for v ^ 1, where F p v is a field with p v elements, Tr : Y p v — > F p = Z/pZ being the trace 
map. Although S u , v 2, never (?) has any interpretation in analytic number theory, it is the 
properties of the generating function 

Z(T) = expQT^r 

which are fundamental in understanding the original sums. In this context, this was first 
recognized and developed by A. Weil, who proved for instance that for a fixed (non-constant) 
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function g £ Z[X] one has 

for all primes p and ^ ^ 1 (with possibly few well-understood exceptions), with an implied 
constant depending only on g. See e.g |IK| §11] for a description of the elementary approach 
of Stepanov and |IK| §11.11] for a first survey of the more advanced cohomological methods of 
Grothendieck, Deligne, Katz and others. 

In terms of applications to analytic number theory, it is clear that the potential of the more 
advanced results has not yet been fully exploited; there are a number of reasons for this, not 
only the complexity of the algebraic geometry involved (although that is certainly a factor), but 
also the difficulty of bringing a natural problem to a position where the Riemann Hypothesis for 
varieties over finite fields can be applied successfully: the reader need only look at the proof of 
the Burgess estimate for short character sums (see e.g. |IK| 12.4]) to see what ingenuity may be 
required; also the comments in |IK| 11.12] explain how the question of uniformity in parameters 
and "flexibility" in the shape of the sums can be crucial matters. 

In this paper, we describe a new general estimate for exponential sums over finite fields 
which combines quite efficiently the cohomological methods (as "black-box") and some results 
and techniques of logic to give estimates where the summation set in the finite field is much more 
general than the algebraic sets that are usually considered. We hope that this added flexibility 
will make it suitable for applications to analytic number theory; also the statement is, in itself, 
quite elementary with very few conditions, and this may also make it appealing to readers 
without a great experience in algebraic geometry (at the price of learning, or remembering, a 
few notions of logic.) 

See Section |31 for the general statement, which is preceded by a section recalling the precise 
formulas permitted as summation conditions. As a sample result, for the very important case 
of a sum in one variable, one gets the following: 

Theorem 1. Let <p(x) be a first- order formula in the language (0, 1, +, — , •) of rings. 1 For every 
ring A, let 

<p(A) = {x £ A | (p(x) holds}. 

Let f , g £ Q(^") be rational functions with f non-constant. Let N ^ 1 be the product of 
primes p such that f modulo p is constant. Then there exists a constant C ^ 0, depending only 
on ip and the degree of the numerator and denominator of f and g such that for any prime p 
and any multiplicative character x modulo p we have 

(3) | £ x{9(x))e{^)\^C{p,N)y 2 ^p-. 

x£<p(Z/pZ) P 
f(x),g(x) defined 

Compared to the classical sums above, the point is that the summation condition can be 
quite complicated, involving arbitrary entanglements of quantifiers (in first-order predicates, 
i.e., applied to elements of the field). One may also wonder if in fact the bound is really non- 
trivial (what if the number of points is usually of size p 1 / 4 , for instance?), but in fact, as proved 
in |CDM| and as we will explain again in detail below, the number of points of summation is 
either ^ i or or ^ cp, for some A ^ 1 and c > depending only on the formula <p. And one 
should keep in mind that if this were applied to a problem of analytic number theory, whether 
this is efficient or not would most often be obvious from the final result anyway. 

Notation. As already mentioned, we denote e(z) = e 2mz . We denote by F q a finite field 
with q elements and usually p is its characteristic. For a finite set X, \X\ denotes its cardinality. 
By / <C g for x £ X, or / = 0(g) for x £ X, where X is an arbitrary set on which / is defined, 
we mean synonymously that there exists a constant C ^ such that |/(:c)| ^ Cg(x) for all 
x £ X. The "implied constant" is any admissible value of C. It may depend on the set X 
which is always specified or clear in context. For notation and conventions concerning logical 



We recall the precise definition in Section |5] 
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formulas, see the beginning of the next section. We use elementary scheme-theoretic language 
for our algebraic geometry (see e.g. |Hal II]); in particular, an algebraic variety over a field F or 
over Z is simply a separated scheme of finite type over F or Z, and in fact only affine schemes 
will occur (so separatedness is automatic); so a variety is not necessarily reduced or irreducible. 
We write either Va or V/A to indicate that a scheme is defined over a ring A. (The choice of A 
is sometimes important to indicate precise dependency for constants that occur.) 

2. Definable sets 

Since the paper involves a fairly unusual mixture of analytic number theory and logic (also 
algebraic geometry, but the latter is kept essentially inside the proofs), we start by recalling 
what are precisely the formulas which define the summation sets we will consider. Of course 
logicians can skip this section without loss, unless they wish to make sure that the author does 
not speak utter nonsense. 

A term in the language (0, 1, + , — , •) of rings is simply a polynomial / G Z[a?i, . . . , x n ] with 
integer coefficients, where the x% are variables; an atomic formula (p is a formula of the type 
f = g where / and g are polynomials (possibly involving distinct sets of variables). Given an 
atomic formula ip, a ring A, and assignments of elements X{ = a% G A to the variables involved, 
the formula <p(a) with a = (aj) substituted for the variables is satisfied in A, denoted 

A |= ip(a) 

if the equality which "is" ip holds when the variables are given the values cij. 

Next the (first-order) formulas in the language of rings are built from atomic formulas by 
induction using the additional logical symbols -i, A, V, and quantifiers 3, V: atomic formulas 
are formulas by definition and if ip and ipi, i G I, are formulas, with / finite, then 

-itpi f\pi \J pi 

iel iei 
are also formulas, and if x is any variable then 

3x p and Vx (p 

are also formulas. Implication — ► and equivalence «-> are defined as abbreviations: 

ipi — ► (p2 means (-^i) V tp 2 , 

ipi <-> ip 2 means ((pi —> p 2 ) A (p 2 — > (pi), 

(and parenthesizing can be introduced for clarity). 

In an obvious way, the relation A |= (p(a) defined for an atomic formula is extended to 
any formula by induction using the usual meanings of the symbols: A as "and" , V as "or" , -i 
as "not". The quantifiers are always extended to elements of A only (not to subsets, not to 
elements of other rings than A). For instance the torsion subgroup of invertible elements in A 
is not definable in the first-order language of rings since the exponent can not be bounded a 
priori. 

Given a formula tp(x, y), where x = (x±, . . . , x n ) and y = (y±, . . . , y m ) are (disjoint) tuples of 
variables, and given a ring A and y G A m , we put 

(4) ip(A, y) = { X GA n | A |= (p(x, y)}, 

in other words, (p(A, y) is the set of n-tuples in A which satisfy the formula tp for the given 
value y of the parameter. Such a set is called a definable set (with parameters); if m = (no 
parameters), then the set is also called 0-definable. 

If p is an atomic formula / = g, the assignment A \— * p(A) is simply the functor that 
associates its ^4-valued points V(A) to the scheme V = Spec(Z[X]/(/ — g)) over Z given by 
the equation / = g. So in general definable sets can be seen as a (substantial) generalization of 
algebraic varieties. 

For instance, the set of squares in A is 0-definable by the formula 3y x = y 2 , but the 
assignment A i— > <p(A) is not a functor (e.g. because an element can become a square after 

3 



extension to a larger field, which can not happen to points of schemes). Here is more complicated 
example: for d ^ 1 an integer, the set of irreducible polynomials of degree ^ d in -A[-X~], identified 
with a subset of A d+l by the coefficients, is ©-definable (for every j, k ^ 1 with j + k ^ d, write 
existential quantifiers on coefficients of two polynomials of degree j and k with product equal 
to the given one). 

See e.g. |Hol 1.3,2.1] or |FJ1 6.1] for more details (involving more general languages) and more 
examples. 

A final notation: to define a formula (f(x), we will sometimes write 

ip(x) = "x satisfies such and such property", 

(and will either explain or leave to the reader to check that the property thus stated in informal 
manner can be written as a first-order formula in the language of rings), or 

(p(x) : ipi(x) . . . 

to indicate that the formula (p(x) is defined to be the expression after ":", which will usually be 
a combination of various bits and pieces; for instance 

<p(x) : (By x = y 2 ) V (By x = y 3 ) V (Vy Bz y 2 + x = z 2 ). 

3. Exponential sums over definable sets in finite fields 

This section defines the exponential sums we want to consider. We generalize from the 
context described in the introduction by introducing formulas with parameters <p(x, y) where 
x = (x%, . . . , x n ), n ^ 1, are the variables and y = (yi, . . . , y m ), m ^ 0, are the parameters. This 
formula is still assumed to be in the first-order language of rings. As in the previous section, 
we let 

<p(A,y) = {x e A n | A |= tp(x,y)} 

for any ring A and parameter y £ A m . 

We consider especially finite fields F q with q elements. Assume that for all q in some subset 
of the powers of primes we have chosen an additive character 

i/> : F q -+ C x 

and a multiplicative character 

X : F* - C x 

(which of course depend on q, although it is not indicated in the notation). We extend x to F q 
by putting x(0) = 0, except when \ 1S t ne trivial character, in which case x(0) = 1- 

Let /i, /2, gi, 52 £ ZLY] be polynomials in X = (X%, . . . ,X n ), with /2 and gi non-zero, 
and let / = /1//2, g = 51/52 (as rational functions). We assume that for all the q under 
consideration the formula (p(x,y) satisfies 

(5) F q \=(<p(x,y)^(f 2 (x)g 2 (x)^0)), 

and if necessary, this can be achieved by replacing (p(x,y) by the formula 

ip(x,y) A (-./ 2 (x) = 0) A (-.0,(1) = 0), 

which "restricts" to the points which are not poles of / and g. This may introduce a further 
dependency of the results below on f 2 and g 2 , but that is of course not surprising, and it will 
be clear that such dependency is really only in terms of the degree of f 2 and g 2 . 

Now finally we introduce the following general exponential sums over a definable set: 

(6) S(y, V ,F q )= Y, Tp(f(x))x(9(x)), 
for all q for which the data is defined. 
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These generalize the more classical exponential sums over the F^-points of an algebraic variety 
V/Z, which corresponds to the case where the formula <p(x,y) is the conjunction of the atomic 
formulas which "are" the equations of V. In that situation we also denote 

(7) S(y,V,F q )= Hf(x)) X (g(x)). 

xeV(F q ) 

The natural goal is to describe a non-trivial upper bound for S(y,ip,F q ) when possible, as 
explicit as possible in its dependencies. For applications to analytic number theory, it is natural 
to look primarily at the so-called "horizontal" case, i.e., when F q = F p is the prime field, and 
our statements are skewed to this case (assuming for instance that p is large enough, instead of 
q large enough, for some condition to hold). 

Here are two fairly simple examples that follow easily from our results. 

Theorem 2. With data as described above, assume that the additive characters ip are non- 
trivial. There exist constants C ^ and n > 0, depending only on the formula (f(x, y) and the 
degrees of the polynomials f\, f'2, gi, 52 such that for any prime p and any parameter y G F™, 
we have 

x&<p{F v ,y) x£tp(F p ,y) 

unless there exists c £ F p such that 

\{x£<p(F p ,y) I f(x)=c}\> V \<p(F p ,y)\. 

The result is stated in this manner in order to make quite clear what the saving is (i.e., 
about p 1 / 2 ) compared to the trivial estimate. We will also recall during the course of the proof 
the main theorem of CDM which gives the approximate value of the number of points of 
summation. The condition for the estimate to hold is sufficient but not necessary (see below 
for examples). It states intuitively that if there is no cancellation, then / must be constant on 
a subset of <£>(F p ,y) which has "positive density". 

This condition may seem difficult to check but in applications it should be the case that 
the "degenerate" cases are fairly obvious, and can be dealt with separately. The second result 
takes a different approach and is a "baby" version of the stratification results of Katz-Laumon 
(compare with [FK ). 



Theorem 3. Let ^(x) be a formula without parameters with x = (x\, . . . ,x n ). Define the 
additive characters ip by ip(x) = e(Trx/p), and let \> fi> 9i be as above. There exist constants 
C ^ 0, D ^ 0, depending only on (p(x) and, in the case of C , on the degrees of fi, gi such that 
for all primes p, all h = (h\, . . . , h n ) £ Fp except at most Dp n ~ x exceptions we have 

I E x( 9 ( X ))e( m + klXl + --- + KXn )\<c(l + p-^ £ l). 

(Intuitively, a sum may not exhibit cancellation, but most of its "twists" will; adding 1 on 
the right-hand side takes care of those p where |<^(F p )| is small). 

These theorems will be derived from more precise "structural" results concerning the expo- 
nential sum and its companions, which are of independent interest and may be useful for deeper 
studies in some cases. 

Before going into details, a very important remark is that we have not found a new source of 
cancellation in exponential sums: the saving will come by application of the Riemann Hypothesis 
for varieties over finite fields (Deligne's theorem). However, the point is that we have a very 
general result, with great flexibility, and with intrinsic uniformity in parameters, so Theorem [21 
has the potential of being a very efficient "black-boxing" of Deligne's result. Certainly in tricky 
cases it might not be easy to perform the reduction to varieties over finite fields explicitly. 
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Remark 4. We have not stated results with purely multiplicative sums. Although they can be 
analyzed by the method of Sections |S] and H3 this tends to reveal trickier aspects which make 
simple statements for a general summation set hard to state (in other words, it seems one needs 
a deeper understanding of the structure of ip(F q ,y)). We will give a few examples in the next 
section. 



4. Examples 

We will now give a few simple examples of the exponential sums we have in mind, and 
make some general comments. We make forward references to the structural results of the next 
sections, as the examples illustrate various aspects involved. 

Example 5. The simplest example of a formula defining a set which is not a variety is probably 
the formula 

ip(x) : 3y x = y 2 

which characterizes the squares in a ring. Thus, if the characteristic p is odd, we have 

\tp(E q )\ = i±± 

and since x \— > x 2 is two-to-one except at 0, for any complex numbers j3{x) defined for i£F, 
we have 



2 2 ' 

x£ip(F q ) XGFg 



If j3{x) = 4>(f(x))x(g(x)) where ip (resp. x) is an additive (resp. multiplicative) character, 
and /, g two polynomials, the second sum is still of this type, but over the points in F q , so it 
can be analyzed in the standard way. 

More generally, one can do similar things for any formula of the type 

3y x = h(y) 

where h is another polynomial, except that one must handle the various possibilities for the 
number of solutions y in F q to the equation h(y) = x: this kind of step is very clear in the 
structure result (Theorem |H}. 

Example 6. We next show that multiplicative characters can be "tricky" (as my son Nicolas 
would say): consider the formula <p(x) in one variable 

3yx 2 + l=y 2 

and take g(x) = x 2 + l and x the multiplicative character of order 2 for all finite fields, non-trivial 
ifp^ 2. Clearly 

E m*))= E 1 

so there is never cancellation. In this case, the reduction in Theorem |H1 below applies with 
K = 1, s = 0, r = 0, k = 1, $i(x,y) = tp(x) and hi t i(x,z) = x 2 - z 2 + 1 (see (fT3|)). e = 2, so 
V = V\ = A\, W = Wi t i is the affine conic with equation X 2 -Z 2 +l = and r = T\ t \ : W —> V 
is given by X. The function gor is, on W, equal to X 2 + 1, and so is the square of the function 
Z on the conic. This is quite obvious, but indicates that some knowledge of the structure of the 
definable sets is required to state a precise criterion for cancellation in a multiplicative character 
sum. Still, this knowledge need only be gained once and may then be applied for many different 
sums. 

Another variant is the following: take the formula in two variables 

tp(x, y) : 3z (x 2 - y 2 + l) 2 + z 2 = 0, 
and consider as before the quadratic character sum over cp(x,y) with g(x) = y 2 — 1. 
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So we introduce the affine variety W with this equation and the projection W — ► A 2 . This 
variety is not absolutely irreducible: adjoining a square root e of —1, it splits as 

((A 2 - Y 2 + 1) - eZ)((X 2 - Y 2 + 1) + eZ) = 0. 

Hence if —1 is a square in F q , there is always a value of z for each (x, y) and the sum is 

E rf + l)=9E* 2 + l)«9 3/2 , 



(i,y)£F, S/6F. 



which gives some cancellation, while on the other hand if —1 is not a square in F q , the points 
of W(F q ) must have z = 0, hence x 2 = y 2 — 1 and the sum becomes 

E x(* 2 ) 

xGF q 
x 2 +l is a square 

which is degenerate as in the previous case, although g is not the square of a function on Wp , 
but only on the intersection of its absolutely irreducible components. 

Another amusing example is as follows: consider variables (a, b) and let <p(a, b) be the formula 

Vx x 2 + ax + b ^ 0. 

For a field F of characteristic ^ 2, we have -F |= tp(a,b) if and only if the discriminant 
A(a, 6) = a 2 — 46 is not a square in F. Hence if we take ip trivial, x °f order 2 and g(a, b) = 
A(a, 6), the sum 

(a,6)G^(F 9 ) 

has no cancellation. This is only "trivial" if one knows about discriminants of quadratic poly- 
nomials, and one can guess that situations involving invariants of more complicated algebraic 
forms will lead to examples which are much less obvious. 

Because of this and the lack of current application, we have not tried to give a cancellation 
criterion for multiplicative character sum in this paper, but we hope to investigate this problem 
further. 

Example 7. Here is a challenging example: let n ^ 1 and consider the formula ip(a), a = 
(do, ai, . . . , a n _i), which expresses that the polynomial 

(8) X n + a^X™- 1 + • • • + ai X + 1 

is irreducible and oqcli - ■ ■ a n _i = 1. Consider then the following exponential sum 



K* = V 

ae</j(F p ) 



Ofl + • • • + Gt n _l 
P 



which is thus a subsum of a hyper-Kloosterman K n ^ p sum in n variables (which only includes 
the condition ciq • • • a n _i = 1 in the summation). Then we have 

(9) |a; p |« p "-v 2 , 

where the implied constant depends on n. Compare this with Deligne's estimate ( |D1| Sommes 
trig. §7]) 

\K n>p \ ^ (n + l)p n / 2 . 
To prove Q, it suffices, according to Theorem |21 to show that the function 

/(a) = a H h a n _i 

is not constant for a positive proportion of a 6 </?(F p ). This follows from the following two facts: 

(1) a positive proportion of a £ F™ with a® - ■ ■ a n _i = 1 define an irreducible polynomial 0; 

(2) / is not constant on a positive proportion of a £ F™ with ao • • • a n _i = 1. Of these, (2) 
is clear (if one wishes, it is a consequence of Deligne's estimate for hyper-Kloosterman sums!), 
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and (1) follows by interpreting the desired number as 1/n times the number of elements in F*„ 
which are of norm 1 and are of degree n over F p (i.e. do not lie in a smaller field). 

Here is a natural question: can the estimate for K* p be made more precise? Is the exponent 
n — 1/2 optimal? 

5. Geometric decomposition of exponential sums over definable sets 

Given a formula y>{x, y), we start by describing a "geometric" decomposition of the definable 
sets ip(F q ,y) and any sum over (p(F q ,y), following |(II)Mj with some more details. Then in 
the next section we will apply the sheaf-theoretic and cohomological methods to express the 
exponential sums in terms of eigenvalues of the Frobenius operator on suitable cohomology 
groups. 

It will be noticed that the notation is somewhat involved, since a large number of parameters 
occur here and below. In the Appendix, we give a list of most of them. 

Theorem 8. Let (p(x, y) be a first-order formula in the language of rings with variables x = 
(xi, . . . , x n ) and parameters y = (yi, . . . , y m ), n ^ 0, m ^ 0. 
There exist the following data, depending only on <p(x,y): 

(i) Integers K ^ 1, s ^ 0, e ^ 1; 

(ii) A prime power qo; 

(iii) For k ^ K , formulas $ ^ (x, x' , y) with auxiliary parameters x' = (x' l5 . . . , x' s ); 

(iv) For k ^ K and i ^ e, affine schemes W Kj i of finite type over Z with maps 

7T K>i : W K>i -> A z +m , r K , 4 : W Kji -> A^; 

with the following properties: 

(1) For every finite field F q with q ^ qo, there exists x' G F q such that 

(10) (p(x, y) «-> ($i(x, x', y) V • • • V $^(x, x, y)) 
/or every y G F™. 

(2) For every field F, every (x',y) G p s + m ^ ^ e se £ s c[> K (F, x', y) ; n^K, are disjoint. 

(3) For all finite fields F q with q ^ qo and (x', y) G Fg +m u>ii/i x' chosen so that (ITU)) holds, 
we have 

(11) x G W K) i(F g ) imp/ies r Kji (x) G $ re (F g ,x',y) 

and at most e elements of W K ^(F q ) satisfy r K; j(x) G $ K (F g ,x',y). 

(4) For all finite fields F q with q ^ qo and (x',y) G Fg +m with x' chosen so that (fTTTf) holds, 
and any complex numbers f3(x) for x G ip(F q ,y), we have 

(12) £ /3(x) = £ £ /3(r Kll (x)). 

xS3> K (F„a;',3/) lsji<e ' xeW K ,i(F 9 ) 

■K K ,i(x)={x' ,y) 

Proof. We follow the reduction steps of |CDM1 p. 123]. This provides us with the following 
data: 

• Integers K ^ 0, s ^ 0, r ^ 0, k ^ 0, e ^ 1, and a prime power go; 

• For each integer k ^ K, polynomials f K) ±, . . . , / Kjr G "Z[X, X' , F] where X' = (X{, . . . , X^) 
is an s-tuple of auxiliary parameters; 

• For each integer k ^ K, polynomials h K j G Z[X, X' , F, Zj] for 1 ^ j ^ k; 
which depend only on the formula </?(x, y) and have the following properties: 

• For every finite field F q with q ^ qo, the formula (p(x,y) is equivalent to the formula 

if' (x, y) = ($i(x, x', y) V $ 2 (x, x', y) V • • • V $^(x, x, y)) 
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where & K (x,x',y) is the formula 



(13) $ K (x,x',y) = (j KjX (x,x' ,y) = A • • • A f K , r (x, x' , y) = 

A (3z 1 ■■■3z k h K>1 (x, x' , y, z\) = A • • • A h K>k {x, x', y, z k ) = 0)) , 

where x' E is some value of the auxiliary variables (see below for a short explanation). 

• For any field F and parameters (x',y), the sets $i(F, x', y),. . . , <&k(F, x' ,y) are disjoint 
in F n ; 

• For each k ^ K and (x',y) E F^ +m , the number of z = (zi, . . . , z k ) E F k such that 
F \= *ff K (x, x', y, z) is bounded by e, where ^ K (x,x',y,z) is the formula 

(14) ^ K (x, x', y, z) = (f K ,i(x, x', y) = A • • • A f K , r (x, x', y) = 

A h K) i(x, x', y, zi) = A • • • A ft.«,fe(x, x', y, z k ) = o) . 

(Precisely, this summarizes the discussion on p. 123 of CDM : (|13|) is the conjunction of 
the formulas (1) of loc. cit., its precise form given in (3) in loc. cit.; the disjointness of the 
& K (F q , x' , y) is stated after (2) in loc. cit.; the existence and property of e is given in (4) in loc. 
cit.) 

Clearly, all this gives already the data of K, s, qo and the auxiliary formulas & K (x, x' , y) such 
that (1) and (2) of the theorem hold. Before continuing to the second part, we explain the 
occurrence of the auxiliary parameters x': they correspond to extra symbols q in the language 
of enriched fields discussed in |CDM, §2]. For any field F, those constants are interpreted as 
coefficients of some irreducible monic polynomial in -F[T]. To see the relevance with definable 
sets <p(F q ,y), notice that for instance the universal formula 

(15) Vi a + ait + a 2 t 2 / 

can be restated, for a finite field F q , by the existential formula stating that aQ + a\t + a2t 2 splits 
in linear factors over the field F 9 2, with no root in F q . This can be expressed in terms of the 
coefficients of an irreducible monic polynomial f 2 = co+ciT+T 2 by factorizing a2(T— y\){T— y 2 ) 
and then expressing F^-rationally the equality 

a + a x t + a 2 t 2 = a 2 (t - yi)(t - y 2 ) with y u y 2 E F q 2 - F q 

in the basis (1, a 2 ), where a 2 is a root of f 2 . 2 It is also useful to mention that it is in constructing 
<E> K that the restriction to large enough finite fields is introduced, this coming from a result of 
van den Dries, based on the Lang- Weil estimate and some model theory (see |CDM1 2.4]). 

Now, to simplify notation for the proof of (3) and (4) of the theorem, which concern the for- 
mulas & K {x, x' , y) individually, we drop the subscript k and we incorporate the new parameters 
x' into y (so that s + m is now denoted m). 

Let V denote the zero set of the polynomials /i , . . . , f r E Z [X, Y] , seen as a closed subscheme 
of A^" 1 "" 1 . We have the projection V — > given by the parameter y, and we denote by V y the 
fibers. Let W denote the common zero set in p^ rm+k Q f trig polynomials and the polynomials 
h\, . . . , h k . Denote by it : W — ► V the obvious projection. 

For j ^ 1, denote 

W y (F q ) j = {xe$(F q ,y) | \7r- 1 (x,y)nW(F q )\=j}. 

Let now f3(x) be arbitrary complex numbers defined for x E &(F q ,y). By ()13|) . for any 
x E $(F g ,y), we have (x,y) E V(F q ) and 7r _1 (x,y) n W(F q ) / 0, so that 

E 0(*) = E E ft*) 

xe$(F q ,y) j^l xeWyiF^j 

2 Alternately, 11511 can be approached by Galois theory; this would lead to the use of the Galois stratification 
method described in |FH.T) . 
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and from the defining property of e, this reduces to 

(16) £ 0(x) = £ £ /?(*). 

To deal with the fact that the fibers of ir do not necessarily all have the same cardinality, we 
now use the same combinatorial procedure as in |CDM| p. 124], although we make the result 
more explicit. 

For 1 < j < e, let Wj denote the intersection in p^ mJr i k Q f the j'-fold fiber product of W 
over V with the open subscheme Uj consisting of points (x, y, z%, . . . , Zj) where all the Z{ are 
distinct /c-tuples, i.e. in point terms we have 

Wj(A) = Uj(A) n {(x,y,zi, . . . ,Zj) | (x, y, Zi) G W(A) for 1 < i < k} 

for any ring A (this corresponds to the formula tyj(X, Y,Z,..., Z^) of loc. cit.) On Wj we 
have the maps 

f Wj — > AS* , f Wj -> A5 

...,Zj)^y y{x,y,zi . .. i-> x; 

we will denote by Wj tV the fiber of ttj over y. All this (with the omitted dependency on n) gives 
the data (iv) of the Theorem and (3) is satisfied by construction. 

Next we claim that the following combinatorial formulas hold: denoting 

= i(i -!)••• + =j\{\ 



we have for 1 ^ j ^ e 



(17) E(% E E E 

»=j ieW,(F,), (i^a «j)elVi(F,) ieW jl9 (F,) 

Indeed, for each x G Wj,(F g )i, the set 7r _1 (x,y) n W(F g ) has i elements and for j ^ i ^ e, any 
ordered subset of length j naturally gives a point of Wj^Fq). All of these points are distinct, 
and of course there are precisely of them. 

Now we use the following elementary lemma (presumably standard in combinatorics): 

Lemma 9. Let e ^ 1. Let Xj, y^ be complex numbers defined for 1 ^ i, j ^ e, such that 

e 

(18) ^2(i)jXi = yj, 

i=j 

for 1 ^ j ^ e. Then we have 

E = E ^4— 

Proof. To see this, solve first the triangular system (fT%|) using = — j)! to get 

(-l)*+i 

(as easily checked using the binomial theorem), and then sum over i to get 

E»-E E - E E A 

_ (-1)3+1 

= E — a— % 

by the binomial theorem again. □ 
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Applied to 

Xi= ^ X )' y 3 = ^2 P{Tj{x)), 

x£W y (F q )i xeW j>y (F g ) 

we get by (JTHJ) and (JT7J) that 

E = £ ^( £ fee*))), 

which is the final conclusion (|12|) . taking account the change of notation made before. □ 

6. Estimates arising from the decomposition of definable sets 

For the second part of the reduction of exponential sums over definable sets, recall that given 
a prime power q and an integer k ^ 0, a complex number a G C is a (/-Weil number of weight 
k if a is an algebraic integer, and any Galois-conjugate a' of a (i.e., any root of the minimal 
polynomial P G Z[T] of a) satisfies \a'\ = q k / 2 . It will be convenient to call a signed (/-Weil 
number a pair (±1, a) of a sign and a (/-Weil number. Usually we just write a and the sign 
is denoted e{a). When a Weil number is written down explicitly as a complex number and 
claimed to be a signed Weil number, this means that the sign is +1. For instance, this applies 
to a = q w , a (/-Weil number of weight 2w. 

To compute and estimate the exponential sums S(y,(p,F q ), we will apply the Grothendieck- 
Lefschetz trace formula and the Riemann Hypothesis over finite fields proved by Deligne to the 
auxiliary varieties ir~](x',y) C W Ky i arising from Theorem El This naturally falls in two steps: 
first, we examine the number of points of summation, then we analyze when an exponential 
sum exhibits cancellation. 

We will first perform both steps for a single W K) i )V , i.e., for a "classical" exponential sum 
(in logical terms, one corresponding to a positive quantifier free formula). The only subtlety is 
that we do not know if W Kj i jV is absolutely irreducible or not (the case usually treated in the 
literature), which affects the precise counting of points and the non-triviality of the estimates. 

Proposition 10. Let W C be an affine subscheme, let q be a power of the prime p, 

C0> X; /) 9) data defining the exponential sum 

S(y,W,F q )= ^U^))x(9(x)) 

(x,y)eW(F q ) 

for y G F™ over W . 

(i) There exist an integer B\ ^ 0, depending only on W , and for all y G F™, there exists an 
integer S(y), ^ 5{y) ^ n, and signed q-Weil numbers ctj(y) for 1 ^ j ^ (3 ^ B\, where (3 may 
depend on y, such that 

w{oij{y)) ^ 2n, maxw(<x,(y)) = 26(y), and aj(y) = q s{ - y) ifw(a>j(y)) = 25(y), 

and 

(19) ]T 1= £ e(aj(y)) aj (y). 

(x,y)eW(F q ) 

(ii) There exist an integer B2 ^ 0, depending only on W and the degree of f and g, and for 
all y G F™ there exist q-Weil numbers Pj(y), for 1 ^ j ^ 7 ^ B2, where 7 may depend on y, 
such that 

w({3 j (y))^28(y) for allj, 

and 

(20) £ i>(f(x)) x (g{x)) = <fy(y))PM- 

(x,y)eW(F q ) 
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Moreover, there exists 77 > 0, depending only on W/Z such that if p is large enough, p ^ po 
where po depends only on W and the degrees of f\ and fi, we have w{(5j{y)) < 25(y) for all j 
unless either ip is trivial or there exists c £ F q such that 

(21) 1 >* E L 

(x,y)eW(F v ) (x,y)GW(F q ) 
f(x)=c 

Proof. We denote by W y the fiber over y of W/F q . For reasons that will become clear soon, 
we first replace W y by the subscheme of Ap^~ m obtained by performing the decomposition- 
intersection process (described in |CDMl §1]), i.e., we replace W y by V y such that 

(i) W y (F q ) = V y (F q ); 

(ii) The absolutely irreducible components of V y are defined over F q . 

It follows from CDM, Pr. 1.7] that V y can be defined as the zero set of N polynomials in 
F g pTi, . . . , X n ] of degree ^ E, and moreover V has at most I absolutely irreducible components, 
where N, E and / depend only on W/Z (in particular are independent of y). 

By (i), we have for all y G F™ 

S(y,W,F q ) = S(V y ,F q ). 

Note it is possible that V y = (for instance for X 2 + 1 = if —1 is not a square in F q ); if that 
is the case, we can take B\ = B2 = 0, and we put 5(y) = 0. So assume V y is not empty. 

Fix a prime I 7^ p, where p is the characteristic of F q . The formalism of the Lang torsor (see 
e.g. |D1| Sommes trig. 1.4], jK2| 4.3] or the sketch in |IK| 11.11]) provides us with the lisse 
Q^-adic sheaf of rank 1 

C = C W) ® C x{g) 

on V y (which depends on p, ip, x, / and g) such that the local trace of a geometric Frobenius 
element Fr^p, at a rational point x S V y (F q ) is given by 

(22) Tr(Fr Xj F 9 | C) = ^(f(x)) X (g{x)) 
and therefore 

S(V y ,F q )= Y, Tr ( Fr -,F 9 I C). 

XGVy(F q ) 

Let V y denote the base change of V y to the algebraic closure of Fq. The Grothendieck- 
Lefschetz trace formula (see e.g. |D1| Rapport]) gives the cohomological expression 

S(V y ,F q ) = ^(-1)*^ | Hl(V y ,£)) 

where H l c are the compactly supported £-adic cohomology groups with coefficient in the sheaf 
C and Fr denotes the geometric Frobenius operator over F q which acts naturally on those finite 
dimensional Q^-vector spaces. Of course, the sum is in fact finite, and more precisely the space 
H l c vanishes for i > 25(y), where 5(y) is the maximal dimension of an irreducible component of 
V y . Because V y is a subset of affine n space, we have 5(y) ^ n. 

By Deligne's fundamental result (his far-reaching generalization of the Riemann Hypothesis), 
since the sheaf C is punctually pure of weight (its local traces being roots of unity), the 
eigenvalues of Fr acting on H l c (V y ,C) are </-Weil numbers of weight ^ i (see |D2| Th. 3.3.1]). 

Thus we obtain (|19|) in the case C = with the family of (/-Weil numbers otj{y) being 
the eigenvalues (indexed by j, and with multiplicity) of the geometric Frobenius on all the 
cohomology groups H l c (V y , Q^) for i ^ 25(y), those becoming "signed" by the factor (— 1)\ 

In this case still, for i = 25(y), let U y be an irreducible component of V y ; by (ii) above, 
U y = Uy x F g for some irreducible U/F q . By Poincare duality (see |D1| Sommes trig., Rem. 
1.18d]), we have 

H?W(U v ,Q t ) = (Q,) 7ri(I/; _ ) (-25(y)) = (Q e )(-25(y)) 
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where U' C U y is a smooth, dense open subscheme of U y , and TTi(U' y ,r]) is the geometric 
fundamental group of V y with respect to some geometric point rj. This precisely means that 

Hc & ^ is of dimension 1 and the geometric Frobenius Fr of F q acts by multiplication by q s ( y \ 
(The only delicate point is that if we use an irreducible component not defined over F q , it is 
the Frobenius Ft" of a field on which it is defined that acts by multiplication by q uS ( y >). 

Since moreover it is known that H% \v y , Q^) is the direct sum of the corresponding coho- 
mology of the irreducible components, it follows that all eigenvalues of weight 25{y) are equal 
to q 2S ( yS) and that the multiplicity is the number of irreducible components of dimension 25 (y) 
(see |D11 Sommes trig., Remarques 1.18 (d)]. This gives all the properties of ay stated in the 
first part of Proposition 

Similarly in the general case we obtain (|2"U)) with the analogue eigenvalues for H l c {V y , £), and 
the weight of these is ^ 25 (y) as stated. 

A crucial point that remains to be checked is that the total number of eigenvalues (i.e. the 
numbers denoted (3 and 7 in the statement of the Proposition) is indeed bounded by B\ or B2 
depending only on W and (in the case of B2) on the degrees of fx, fi, gi, 92- 

This is precisely given by a very useful result of Katz |K1| Th. 12], because we can embed V y 
as a closed subscheme of an affine space A|f where M depends only on W/Z, using equations 

of degree and number bounded only in terms of W/Z (by the uniformity of the decomposition- 
intersection procedure described at the beginning). 3 

Now we analyze when we can get some cancellation in (|2U|). We assume that ifi is non-trivial 
and that max w(/3j(y)) = 25{y) and will show then that (|21jl holds for some c. 

The hypothesis implies that for some irreducible component U y = U y x F q of V y , the coho- 

mology group Hc S ^ y \lI y , C) does not vanish (since all H l c with % > 26 (y) do vanish and those 
with i < 25 (y) yield Weil numbers with smaller weight). 

If need be, we replace U y without changing notation by a smooth non-empty open subscheme. 
Since C restricted to U y is a lisse sheaf on a smooth connected scheme, we have by Poincare 
duality (as before) the co-invariant formula 

(23) H?^{U y ,C) = (C v ) MVyjV) (-26(y)) 

for any geometric point 77 of U y , C v being the fiber of C at rj. Since C is of rank 1 and the 
cohomology does not vanish, we must therefore have 

(A). l( l7 !/ ,,)(-2<5(y))=A(-2%)), 

i.e., the sheaf C is geometrically trivial. From the exact sequence 

1 -» 7i"x (Uy ; ff) -» Trx(Uy,ri) -» Gal(F ? /F 9 ) -» 1, 

and (|23|) . it follows that C on U y "is" a character of iri(U y ,r]) which comes from a character of 
Gal(Fq/F g ). This means in particular that the local trace of C at the geometric Frobenius of a 
point of U y {F q ) is constant i.e. (by (|22jl). for all x,xo G U y (F q ) we have 

4>{f{x))x{g(.x)) = 4>{f{xo))x{g(.xo))- 

Taking the D-th power (where D is the order of x) yields the equality 

(24) t/>(D(f{x)-f(x ))) = l. 

We can write ift(x) = e(Tr(ox)/p) for some a G , since ip is assumed non-trivial. Then (|24jl 
and the injectivity of x 1— > e(x/p) on F p mean that for all x £ U y (F q ), the element aDf{x) 
is in a coset of the kernel of the trace from F q to F p . Such a coset has cardinality p _1 |F g |, 
but on the other hand if / is non-constant on U y , and if q is large enough, there are at least 
qj max(deg(/i), deg(/2)) elements of the form aDf(x) in F q . (Note D = 1 (modp).) 



It may seem surprising, but it is indeed true that B2 is independent of the order of the multiplicative 
character \. 
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So under the hypothesis that w{y) = 25 (y), we see that if p is large enough (depending on 
W and the degrees of f\ and $2) it must be the case that f\U y is constant. Since the number 
//(y) of irreducible components of V y of dimension 5{y) is bounded in terms of W/Z only, the 
standard counting 

\W y (F q )\=n(y) q s M + 0(q s W- 1 / 2 ) 
(with absolute constant depending only on W/Z) coming from (fTH|) and its analogue 

\U y (F q )\ =g%)+0(g%)-V2) 

show that (|21j) holds for c the constant value of / on U y (F q ), p large enough in terms of W/Z. 
(We could also have argued more geometrically using the triviality of the Artin-Schreier covering 
associated to C, as in |D1| p. 24, last §]). □ 

Remark 11. There is presumably a cohomological interpretation of the decomposition-inter- 
section process, which means essentially a general analysis of the cohomology of W/F q with 
coefficient (at least) in a sheaf of the type C above, in the case where W is not absolutely 
irreducible, explaining how the trace formula boils down to that of the variety obtained by the 
process. However the author has not found such a result in the literature. 

A drawback of using this procedure is that the Weil numbers involved are not uniquely 
determined by the exponential sum S(y,W, F q ) and its companions 

(25) S„(y,W,F q )= ^f(x))x(Ng(x)) 

(where Tr and N are the norms from F q » or F*„ to Fq), rather they are uniquely determined 
by the sequence of sums 

S u (V y ,F q ,)= ^f(x))x(Ng(x))= £ e^y^M" 

for v ^ 1, in the sense that the multiplicity (with signs taken into account) 

of any g-Weil number a is determined by (S u (V y ,F q )). However, those sums for v ^ 2 are not 
necessarily related to the original sum S(y, W, F q ). 

One can ignore all this, in a sense, and apply the Grothendieck-Lefschetz trace formula and 
Deligne's Theorem to W y directly. But then the computation of the eigenvalues for the top- 
dimensional cohomology is not valid unless d\mW y /F q = 5(y); note that this will often be 
the case for a concrete W/Z (e.g. if W/Z is generically absolutely irreducible), but in our 
applications to definable sets, the auxiliary varieties are not so well controlled, especially as 
they depend on the value of the auxiliary parameters x'. 

We come back to a general formula ip(x,y) and start with the special case / = 0, g = 1, 
ip = 1, x = 1 that counts the points of the definable sets. 

Theorem 12. Let (p(x, y) be a first-order formula in the language of rings and let K and e 
be given by Theorem^ for ip. There exists an integer B\ ^ 0, depending only on ip, with the 
following property: for all q large enough and all y £ F™ there exist signed q-Weil numbers 

a K)i) i(y), . . . , a K:i:f3 (y), with 1 < k < K, 1 < % < e, 

with (3 ^ B\, P possibly depending on y, such that 

i»(® K ,i,j(y)) < 2n, vaaxw(at Kt ij(y)) = 25 (y) is even, 

<*K,iAv) = <i 5(y) '/ "•!",,,,(//);• = 2<%), 
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and we have 

(26) HF q ,y)\= Y, E E e(a K «(2/))a KiM (y). 

Finally, ifip(F q ,y) is not empty, the multiplicity "up to sign" of a = q 2S ^ is strictly positive, 

m»)= EEE ( ""' 4 'f' ,ll)l >' 

" Kll ,i(s/)=g M(a) 
We thus recover the main theorem of [H*DM . 

Corollary 13 (Chatzidakis-van den Dries-Macintyre) . (1) For all finite fields F q and y G F™, 
we have 

\<p(F q ,y)\ = ti(y)q s W + O^)^) 

where 

EEE ^t^Q 

w(ot K ,i,j(y))=28(y) 

is > unless ip(F q ,y) = 0, and £/ie implied constant depends only on (p. 

(2) There exist only finitely many pairs {d, fi) with d ^ and /i£Q which arise as (5(y), fi(y)) 
for some finite field F q and y G F™. 

(3) For eac/i pair (d,fi), d ^ an integer and // G Q, i/iai can arise as (5(y), /j,(y)), there 
exists a formula Cd^p in the language of rings, depending only on d, (jl and if, such that for 
any finite field F q , we have F q |= Q,^ )¥ , if and only if (6(y), fj,(y)) = (d, //). 

Proof. (1) For q large enough, this is obvious from (|2(i|) (with f = 1) and the stated properties 
of the weights of the a K ij(y), recalling that the sign e{q 2& ^) is +1 by convention; in fact we 
get 

\tp{F q ,y)\-n{y)q Siy) \ ^3KB 1 q 6 ^- 1 / 2 

for q go large enough so that (|26j) applies. 

Replacing the constant 3KB\ by max(3i^i?i, C) where C ^ satisfies 

|^(F 9 ,y)|- M (y)^)| ^C^)- 1 ^ 

for g ^ qo, we obtain the result for all q. 

(2) This is clear since we have the trivial bound \<p(F q , y)\ ^ q n for all n which gives 5(y) ^ n, 
and because there are at most 2 K+e+Bl choices of subsets of summation of indices (k, i,j) that 
can occur in defining fj,(y). 

(3) This is proved in |CDM| Prop. 3.8] (and can be guessed from the proof of Theorem El 
and Proposition □ 

We refer to |CDMl 4.9,4.10] for intrinsic interpretations of the "dimension" S(y) and the 
"measure" fi(y) in the context of pseudo-finite fields. 

We can now come back to the general exponential sums (jHJ) as we are in a position to compare 
the estimates obtained with the number of points of summation. 

Theorem 14. Let (/i,<?i, fii9i, {V 1 }) {%}) ^ e the data defining a family of exponential sums (jfijl 
over the definable sets <p(F q ,y). 

(1) There exists an integer B2 ^ 0, depending only on ip and the degrees of f\, g\, f2, g2, 
with the following property: for all q large enough and all y G F™, there exist signed q-Weil 
numbers 

a K ,i,i{y)i •"■> a K,i,f3(y), with 1 < k ^ K, 1 < % < e, 
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for j3 ^ B-2, (3 depending possibly on y, such that 



w(a K:i j(y)) < 25(y), for all k, i, j 



and 



(27) S(y )( p,F q )= E Ml— E e(a K ^{y))a K ^{y), 



hence 



(2) Let 



|S(y,p,F,)| < 3KB 2 q w ^/ 2 = 3K 2 Bq 5 ^-^ 2 . 



w(y) = maxw(a^i j(y)) < 2<%), 



denote the maximal weight of the Weil numbers occurring in this decomposition, and 7(y) = 
26(y) - w{y) > 0. 

There exists r\ > depending only on ip(x, y) such that for p is large enough, depending only 
on (p(x,y) and the degrees of fi and f 2 , we have w(y) < 25 (y), i.e., 7(2/) > ; unless ip is trivial 



or there exists some c E F q with 

E i>* E !• 

x€tp(F q ,y) x€<p(F q ,y) 
f(x)=c 

Remark 15. (1) We emphasize that in part (2), we must have p large enough, and not only q. 
This is necessary even for classical sums, since if we fix the additive characters by defining ifi on 
F p v as e(Tr(x)/p), we have ip{x p — x) = 1 for all x £ F p v, so that for a polynomial / congruent 
modulo p to a polynomial of the form g(x) p — g(x), we have 

E */>(/(*)) =p v 

for all v ^ 1. And of course, such congruences can a priori hold for a large number of distinct 
primes. So our statement only provides a criterion for cancellation in the "horizontal" direction 
p — > +00 which is of most interest in analytic number theory. 

(2) Note that of course it is only when 7(2/) > that this has any interest. The condition 
stated to ensure this does not sound particularly convenient, but that is partly because of 
the generality allowed. Essentially, for additive character sums, it says that there is some 
cancellation in S(y, (p, F q ), unless it turns out that / is constant for a positive proportion of the 
points of summation. This is a generic property; one may say that it corresponds to ip(F q ,y) 
being "transverse" in some sense to the level sets of /. In concrete applications, it should be 
clearer how to check this condition. 

Proof of Theorem [TM and Theorem\14\ By (fT^]) with (5{x) = i)(f(x))x(g{x)), we are reduced to 
the sums over the F g -rational points of the fibers of 7r K) j : W K % — > A n , which are algebraic 
varieties. We consider each in turn, assuming q ^ q$ as in Theorem |H] and always consider that 
a choice of the auxiliary parameters x' has been performed, which we incorporate into y for 
clarity. In any case we apply Proposition ^] to each (with y replaced by (x',y)) in turn; 
since at most eK such varieties occur, and both e and K are determined by cp only, the total 
number of Weil numbers that will occur is bounded in terms of (p (for the counting problem) or 
<p and the degrees of the functions / and g in the general case. 

The maximal weight is the maximal value of the 25 K i(y) of Proposition 1101 applied to W K i- 
Denoting it 25 (y) it follows that each of the Weil number occurring with this weight is in fact 
equal to q 5( - y \ 
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The remainder of the statement of Theorem ^] is clear except maybe that in Theorem ^] 
the multiplicity 



rtv)= 2J2J2J 



is > if the set <p(F q ,y) is not empty. But otherwise the number of elements of (f(F q ,y) 
would be <C q s (y)~ 1 / 2 J with implied constant depending on 93 only, while, for any k, i such that 
$K,i{y) = 8(v)i the number of x € W K ^ y (F q ) is at least itj^W if q is large enough (in terms of ip 
only), and the images t k j(x) yield at least g|Wre,i,2/(Fg)l elements of ip(x,y) (see (fTT)l and the 
property of e following, or look back at the proof of Theorem [SJ) . Hence we do get the result 
stated in Theorem 1121 for all q large enough. 

There remains to examine the condition stated in Theorem ^] for the exponential sum to 
have maximal weight w(y) < 5(y). For this, assume w(y) = S(y). If ip is non trivial, then (|21[) 
must hold for some k, i (with 77 > depending only on tp(x,y)) with S K< i(y) = S(y), i.e., with 

\W K ^ y (F q )\^ m \^(F q ,y)\ 

where again 771 > depends only on (p(x,y). This gives part (2) of Theorem 1141 □ 

We state finally the following corollary: 

Corollary 16. Let fii92i {x}) ^ e ^ e data defining a family of exponential sums 

over the definable sets (f(F q , y), and let 7(2/) 6e as in the previous statement. There exists 77 > 
depending only on ip(x, y) such that for ip non-trivial we have 

5(y,^,F p )«/^- 1 / 2 

for all primes p and all y £ F™ /or which there does not exist c E F p such that 



E 

f(x)=c 



The implied constant depends only on <p(x,y) and the degrees of f\, f%, g\, 02. 

Proof. This is immediate after enlarging if necessary the constant arising from estimating the 
exponential sum using (|27|) and 7(7/) > for p large enough under the condition stated. □ 

The sample statements given in the previous sections are special cases of this corollary. In 
Theorem ^ (with one variable, no parameter, ip(x) = e(x/p) non trivial), either 5(y) = in 
which case (JHJ) is trivial, or 5(y) = 1, and if / is a non-constant rational function modulo p, it 
can not be constant on a set in Z/pZ containing 3> p points. If / is constant modulo p, the 
estimate is again trivial as p \ N in that case. 

Theorem [21 on the other hand is just a rephrasing of the corollary. As for Theorem |21 we are 
considering the sums 

V- , f (f{x) + (h,x)- 

ze v (F p ) ' P 

with (h, x) = hiXi. Note the degree of fh = f+(h, •) is uniformly bounded for all h. Consider 
an h where the gain for this sum is 7^ = 0. After performing the reduction of Theorem |S] for 
<p(x), it is clear from the proof of Theorem 1141 above that there is a fixed 77 > and a finite list 
of subsets &i of ip(F p ) (finite list with cardinality ^ D where D depends only on (f(x)) such 
that if 7/j = 0, then / is constant (say = c) on one of the 3>j. 

Fixing one which we can assume is not reduced to a single point (otherwise the desired 
estimate is trivial), the values of h corresponding to this <I>j are such that $j is contained in an 
affine hyperplane "parallel to h" , i.e., with equation (x, h) = constant. Simply by linear algebra, 
this means that <I>j must be in some intersection V of such affine hyperplanes, and the assumption 
that <I>j has more than one element means that the minimal such V is of dimension k ^ 1; then 
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one sees that h must lie in a "complementary" affine subspace of dimension n — k < n. Hence 
there will be at most Dp" 1-1 exceptions with 7^ = 0. (Note that if no $j is in an hyperplane, 
we in fact showed that there are only finitely many "bad" values of h). 

Remark 17. The reduction theorem and the subsequent expansions of exponential sums in terms 
of Weil numbers suggest an intriguing question: how intrinsic are those decompositions? Can 
one define some kind of cohomology theory for certain sheaves on definable sets over finite fields 
in such a way as to obtain formulas like (|27[) as consequences of a trace formula operating 
directly at that level? Since it seems that a condition q qo is necessary, this may be better 
dealt with at the level of pseudo-finite fields. We hope to come back to such foundational issues. 

Finally, the author (at least) can't help wondering if this introduction of some ideas of logic 
and model theory might not be one clue to the hypothetical theory of "exponential sums over 
Z" that Katz has written about, for instance, in |K3j . 

More down to earth, one may hope that exponential sums estimates for non-trivial characters 
could be useful in other areas, noting for instance that Corollary of Chatzidakis, van den 
Dries and Macintyre has had applications in model theory (but the author doesn't understand 
those) and, in remarkable work of Hrushovski and Pillay HPJ, in the study of algebraic groups 
over finite fields (in particular the behavior of the reduction modulo p of a group defined over 
Z for large p). 

7. Application: definable intervals 

We come back to general exponential sums still with f(n) = g(n)/p for some prime p, 
but now over a short interval: 

S p ,#(g)= Yj e 2m9(n)/p where N = / 

0<n^N 

for some $ g]0, 1[, the inequality $ < 1 being characteristic of a "short" interval. 

As long as t? > ^, a well-known technique of Fourier completion (see e.g. |IK1 12.2]) leads 
to complete sums (i.e., with -d = 1), for which the results of algebraic geometry can often 
by applied, giving a bound of size roughly ^/p(logp), so an estimate for S p ^(g) with saving 
0(N) = N 1 - 1 /^ -» +cxd (see ©). 

However, very few cases have been handled when i? ^ ^. Since the most successful high-level 
approach to exponential sums in general has been the insertion of a given sum in a sort of 
"family", 4 of sometimes seemingly unrelated sums, and then exploitation of properties of the 
family as a whole to derive individual results, one may hope to do so by analogy with the case 
of complete sums over finite fields by extending S p ^(g) in some way to all finite fields F q . 

So the following question is of interest: can one "lift" the short intervals ^ x < p® to subsets 
of Y p v in such a way that the corresponding companion sums to S p (g) have a sufficiently rich 
structure to become more accessible? 

There are of course many ways to envision such a lifting, but the most optimistic version is: 
are short intervals definable by a uniform formula ip(x) in one variable in the language of rings? 

More generally, let <p(x) be such a formula. The question is: when is it the case that </?(F p ) 
is an interval for almost all p, i.e., when is it the case that for all but finitely many p there exist 
integers a p ^ b p for which 

(p(F p ) = {x (modp) I x = i (modp) for some integer i, a p ^ i ^ b p }. 

Of course, given a fixed interval / = {n, n + 1, . . . ,n + m}, n ^ 0, the formula 

tp(x) : (x — n) ■ (x — (n + 1)) • • • {x — (m + n)) = 



Not only for sums over finite fields: the strategy of the standard methods of Weyl, van der Corput and 
Vinogradov can also be understood in this manner. 
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is such that <p(F p ) coincides with the reduction of I modulo p for all p > m. So of course 
similarly the reduction of the interval {n, n + 1, . . . ,p — 1} of length p — n is defined by 

ip(x) : x ■ {x — 1) • (x — (n — 1)) ^ 0. 

We can easily use the tools of the previous sections to show that those are essentially the 
only possibilities. 

Proposition 18. Let <p(x) be a formula in one variable in the language of rings. If neither 
t/?(F p ) nor ^(p(F p ) are bounded for all primes, there are only finitely many primes such that 
tp(F p ) is the reduction modulo p of an interval. 

Proof. By the Main Theorem of |CDM| (or Theoremll2() applied to ip and —up, there are constants 
A ^ 1, C ^ and finitely many rationals /Uj e]0, 1[, such that for each prime p either 

|V(F P )| ^ A or |-.p(F p )| ^ A 

or 

\cp(F p )\ -mp < CVP- 

for some i. If p runs over a subsequence of primes (tending to +oo) for which (p(F p ) is an interval, 
and is unbounded, the fmiteness of the set of ^ shows that for some i a further subsequence 
exists for which <p(F p ) is an interval of length |(/?(Fp)| ~ [iip. We will show this is impossible. 
For this we observe first that by Theorem ^ we have 

(28) ^ 



E 



for all primes, the implied constant depending only on <p. 

Let now p be such that (p(F p ) is an interval, say (p(F p ) 
a geometric progression gives 



{m p , . . . , m p + n p — 1}. Summing 



E ■ 



P 



m, 







sin 

P 


'-«(;) 




7T 

sm — 

P 



For the given hypothetical subsequence p —* +oo, we have n ? 



/iiP, so the 



numerator converges to sin7r^j ^ (since fa ^ {0, 1}). On the other hand the denominator is 
equivalent to n/p, hence we find that 



< 29 > I E v 

zg</?(F p ) 

which contradicts (|28j). This concludes the proof. 



ir \ sin irni\p, 



□ 



Remark 19. Theorem ^ shows more generally that as p 
unbounded, we have 

1 



lim 

p^+oo \ip(F p ) 



E <f 

x£<p(F p ) 



+oo along any sequence with \(p(F p ) 
= 0, 



for any fixed integer h ^ 0. 

Hence, using Weyl's equidistribution criterion, it follows that the fractional parts {^} for 
x S <p(F p ) become equidistributed in R/Z for Lebesgue measure as p runs over primes with 
|y>(F p )| — > +oo, i.e., for every continuous function / : R/Z — ► C, one has 

1 ^ /( 



lim 

P-++00 \ip(F p 



E 

xG V (F p ) 



/(0)d0. 



R/Z 



Note that such a result is well-known for algebraic sets (i.e., formulas without quantifiers); for 
a much more general statement of Fouvry and Katz, see |FK| . 
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Also, the proposition extends immediately to general arithmetic progressions : if <p(x) is a 
formula with one variable in the language of rings, then <p(F p ) can be for infinitely many primes 
of the form 

A a ,q,k = {a,a + q,...,a + n p q} (modp) 
for some integers a ^ 1, q (which may depend on p), with p \ q, only if either <p(F p ) or its 
complement is bounded for all p (the case of the complement can only occur for q = 1 of 
course). Indeed, one need only compute the exponential sum 

(where q is the inverse of q modulo p), to obtain an analogue of (|29j) . 

Appendix: notation index 

For the reader's convenience, here is a list of the notation that occur in the reduction theorem 
and the decomposition theorem for exponential sums, with a brief explanation of their meaning. 



n 


TVT u„ \ o ~C ~ -11 • / \ 

JNumber ^ U ot variables in <p{x, y) 


m 


Number ^ of parameters in tp(x, y) 




Theorem |S] (integer ^ 1, number of disjunctions expressing ip) 


s 


Theorem IH1 ( integer ^ 0, number of auxiliary parameters needed) 


X 


Value or variables for the auxiliary parameters 


K 


Index running from 1 to K 




Formulas in the disjunction expressing ip 


K 


Proof of Theorem 151 (integer ^ 0, number of existential terms in <f> K ) 


T 


Proof of Theorem|HJ (integer 0, number of terms in formulas $k) 


/«,• 


Terms occurring in $ K 


h Kj . 


Terms existing in existential form in <J> K 


* K 


Proof of Theorem [S] (auxiliary quantifier free formulas) 


<7o,Po 


Generically, value of q or p so that a statement holds for q ^ qo or p ^ po 


e 


Theorem [H] (integer ^ 1, maximal number of pre-images for a given x S ip(F q ,y)) 


hj 


Indices running from 1 to e 


w Kji 


Affine schemes projecting "to (p" 


7T re ,i 


Projection from W K i to space of parameters 




Projection from W Kl i to definable sets 


e(a) 


Paragraph before PropositionllOl ( "sign" of a Weil number) 


B t 


Theorem 1121 (maximal number of Weil numbers for the counting function) 


B 2 


Theorem 1141 (maximal number of Weil numbers for exponential sum) 


a 


Theorems 1121 and 1 141 (number of Weil numbers occurring for given parameter y) 


3 


Theorems 1121 and 1141 (index running from 1 to f3) 




Theorems 1121 and 1 1 41 ( Weil numbers giving exponential sum) 


%) 


Theorem 11 21 (maximal weight of Weil numbers in counting points) 


t*(y) 


Theorem H*2l (density of point counting) 


w(y) 


Theorem 1141 (maximal weight of Weil numbers in exponential sum) 




Theorem 1141 (gain in bound for exponential sum) 
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